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SEQUENCE LISTING 



(1) GENERAL INFORMATION 

(i) APPLICANT: Goli, Surya K. 

Hillman, Jennifer L. 

(ii) TITLE OF THE INVENTION: A NOVEL GLUTATHIONE S -TRANSFERASE 

(iii) NUMBER OF SEQUENCES: 5 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: INCYTE GENOMICS, INC. 

(B) STREET: 3160 Porter Drive 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: US 

(F) ZIP: 94304 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 
(D> SOFTWARE: FastSEQ 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/309,320 

(B) FILING DATE: May 11, 1999 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/309,320 

(B) FILING DATE: May 11, 1999 

(A) APPLICATION NUMBER: 09/096,571 

(B) FILING DATE: June 12, 1998 

(A) APPLICATION NUMBER: 08/756,771 

(B) FILING DATE: November 26, 1996 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME : Billings, Lucy J. 

(B) REGISTRATION NUMBER: 36,749 

(C) REFERENCE /DOCKET NUMBER: PF-0162 US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-855-0555 

(B) TELEFAX: 415-845-4166 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Consensus 

(B) CLONE: Consensus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



Met 


Ala 


Ala 


Arg 


Pro 


Lys 


Leu 


His 


Tyr 


Pro 


Asn 


Gly 


Arg 


Gly 


Arg 


Met 


1 








5 










10 










15 




Glu 


Ser 


Val 


Arg 


Trp 


Val 


Leu 


Ala 


Ala 


Ala 


Gly 


Val 


Glu 


Phe 


Asp 


Glu 








20 










25 










30 






Glu 


Phe 


Leu 


Glu 


Thr 


Lys 


Glu 


Gin 


Leu 


Tyr 


Lys 


Leu 


Gin 


Asp 


Gly 


Asn 






35 










40 










45 








His 


Leu 


Leu 


Phe 


Gin 


Gin 


Val 


Pro 


Met 


Val 


Glu 


He 


Asp 


Gly 


Met 


Lys 




50 










55 










60 










Leu 


Val 


Gin 


Thr 


Arg 


Ser 


He 


Leu 


His 


Tyr 


He 


Ala 


Asp 


Lys 


His 


Asn 


65 










70 










75 










80 


Leu 


Phe 


Gly 


Lys 


Asn 


Leu 


Lys 


Glu 


Arg 


Thr 


Leu 


He 


Asp 


Met 


Tyr 


Val 










85 










90 










95 




Glu 


Gly 


Thr 


Leu 


Asp 


Leu 


Leu 


Glu 


Leu 


Leu 


He 


Met 


His 


Pro 


Phe 


Leu 








100 










105 










110 






Lys 


Pro 


Asp 


Asp 


Gin 


Gin 


Lys 


Glu 


Val 


Val 


Asn 


Met 


Ala 


Gin 


Lys 


Ala 






115 










120 










125 








He 


He 


Arg 


Tyr 


Phe 


Pro 


Val 


Phe 


Glu 


Lys 


He 


Leu 


Arg 


Gly 


His 


Gly 




130 










135 










140 










Gin 


Ser 


Phe 


Leu 


Val 


Gly 


Asn 


Gin 


Leu 


Ser 


Leu 


Ala 


Asp 


Val 


He 


Leu 


145 










150 










155 










160 


Leu 


Gin 


Thr 


He 


Leu 


Ala 


Leu 


Glu 


Glu 


Lys 


He 


Pro 


Asn 


He 


Leu 


Ser 










165 










170 










175 




Ala 


Phe 


Pro 


Phe 


Leu 


Gin 


Glu 


Tyr 


Thr 


Val 


Lys 


Leu 


Ser 


Asn 


He 


Pro 








180 










185 










190 






Thr 


He 


Lys 


Arg 


Phe 


Leu 


Glu 


Pro 


Gly 


Ser 


Lys 


Lys 


Lys 


Pro 


Pro 


Pro 






195 










200 










205 








Asp 


Glu 


He 


Tyr 


Val 


Arg 


Thr 


Val 


Tyr 


Asn 


He 


Phe 


Arg 


Pro 








210 










215 










220 











(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Consensus 

(B) CLONE: Consensus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

ATGGCAGCAA GGCCCAAGCT CCACTATCCC AACGGAAGAG GCCGGATGGA GTCCGTGAGA 60 

TGGGTTTTAG CTGCCGCCGG AGTCGAGTTT GATGAAGAAT TTCTGGAAAC AAAAGAACAG 120 

TTGTACAAGT TGCAGGATGG TAACCACCTG CTGTTCCAAC AAGTGCCCAT GGTTGAAATT 180 

GACGGGATGA AGTTGGTACA GACCCGAAGC ATTCTCCACT ACATAGCAGA CAAGCACAAT 240 

CTCTTTGGCA AGAACCTCAA GGAGAGAACC CTGATTGACA TGTACGTGGA GGGGACACTG 300 

GATCTGCTGG AACTGCTTAT CATGCATCCT TTC TTAAAAC CAGATGATCA GCAAAAGGAA 360 

GTGGTTAACA TGGCCCAGAA GGCTATAATT AGATACTTTC CTGTGTTTGA AAAGATTTTA 42 0 

AGGGGTCACG GACAAAGCTT TCTTGTTGGT AATCAGCTGA GCCTTGCAGA TGTGATTTTA 480 

CTCCAAACCA TTTTAGCTCT AGAAGAGAAA ATTCCTAATA TCCTGTCTGC ATTTCCTTTC 540 

CTCCAGGAAT ACACAGTGAA ACTAAGTAAT ATCCCTACAA TTAAGAGATT CCTTGAACCT 600 

GGCAGCAAGA AGAAGCCTCC CCCTGATGAA ATTTATGTGA GAACCGTCTA CAACATCTTT 660 

AGGCCATAAA AC AAC AC ATC CATGTGTGAG TGACAGTGTG TTCCTAGAGA TGGTATTGTC 720 
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TACAGTCATG TCTTAATGGA TCCCAGCTCT GTCATGGTGC TATCTATGTA TTAAGTTGGG 780 
TCCTAAGTTG GGTCTTTTGT 800 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 825605 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Met 


Ala 


Glu 


Lys 


Pro 


Lys 


Leu 


His 


Tyr 


Ser 


Asn 


Thr 


Arg 


Gly 


Arg 


Met 


1 








5 










10 










15 




Glu 


Ser 


He 


Arg 


Trp 


Leu 


Leu 


Ala 


Ala 


Ala 


Gly 


Val 


Glu 


Phe 


Glu 


Glu 








20 










25 










30 






Lys 


Phe 


He 


Lys 


Ser 


Ala 


Glu 


Asp 


Leu 


Asp 


Lys 


Leu 


Arg 


Asn 


Asp 


Gly 






35 










40 










45 








Tyr 


Leu 


Met 


Phe 


Gin 


Gin 


Val 


Pro 


Met 


Val 


Glu 


He 


Asp 


Gly Met 


Lys 




50 










55 










60 










Leu 


Val 


Gin 


Thr 


Arg 


Ala 


He 


Leu 


Asn 


Tyr 


He 


Ala 


Ser 


Lys 


Tyr 


Asn 


65 










70 










75 










80 


Leu 


Tyr 


Gly 


Lys 


Asp 


He 


Lys 


Glu 


Lys 


Ala 


Leu 


He 


Asp 


Met 


Tyr 


He 










85 










90 










95 




Glu 


Gly 


He 


Ala 


Asp 


Leu 


Gly 


Glu 


Met 


He 








Pro 


Phe 


Thr 








100 










105 










110 






Gin 


Pro 


Glu 


Glu 


Gin 


Asp 


Ala 


Lys 


Leu 


Ala 


Leu 


He 


Gin 


Glu 


Lys 


Thr 






115 










120 










125 








Lys 


Asn 


Arg 


Tyr 


Phe 


Pro 


Ala 


Phe 


Glu 


Lys 


Val 


Leu 


Lys 


Ser 


His 


Gly 




130 










135 










140 










Gin 


Asp 


Tyr 


Leu 


Val 


Gly 


Asn 


Lys 


Leu 


Ser 


Arg 


Ala 


Asp 


He 


His 


Leu 


145 










150 










155 










160 


Val 


Glu 


Leu 


Leu 


Tyr 


Tyr 


Val 


Glu 


Glu 


Leu 


Asp 


Ser 


Ser 


Leu 


He 


Ser 










165 










170 










175 




Ser 


Phe 


Pro 


Leu 


Leu 


Lys 


Ala 


Leu 


Lys 


Thr 


Arg 


He 


Ser 


Asn 


Leu 


Pro 








180 










185 










190 






Thr 


Val 


Lys 


Lys 


Phe 


Leu 


Gin 


Pro 


Gly 


Ser 


Pro 


Arg 


Lys 


Pro 


Pro 


Met 






195 










200 










205 








Asp 


Glu 


Lys 


Ser 


Leu 


Glu 


Glu 


Ser 


Arg 


Lys 


He 


Phe 


Arg 


Phe 







210 215 220 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 259141 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
Met Ala Glu Lys Pro Lys Leu His Tyr Phe Asn Ala Arg Gly Arg Met 
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1 








5 










10 










15 




Glu 


Ser 


Thr 


Arg 


Trp 


Leu 


Leu 


Ala 


Ala 


Ala 


Gly Val 


Glu 


Phe 


Glu 


Glu 








20 










25 










30 






Lys 


Phe 


He 
35 


Lys 


Ser 


Ala 


Glu 


Asp 
40 


Leu 


Asp 


ys 


eu 


45 9 




Asp 


Gly 


Tyr 


Leu 


Met 


Phe 


Gin 


Gin 


Val 


Pro 


Met 


Val 


n 

u 


e 




Gly 


Met 


Lys 




50 










55 
















Leu 


Val 


Gin 


Thr 


Arg 


Ala 


He 


Leu 


Asn 


Tyr 


.. 

e 


Ala 


Ser 


Lys 


Tyr 


Asn 


65 










70 


















80 


Leu 


Tyr 


Gly 


Lys 


Asp 
85 


He 


Lys 


Glu 


Arg 


Ala 
90 


Leu 


He 


Asra 


Met 


Tyr 
95 


He 


Glu 


Gly 


He 


Ala 


Asp 


Leu 


Gly 


Glu 


Met 


He 






Leu 


Pro 


Val 


Cys 








100 










105 










110 




Pro 


Pro 


Glu 


Glu 


Lys 


Asp 


Ala 






Ala 








Glu 




He 






115 










120 










125 






Lys 


Asn 


Arg 


Tyr 


Phe 


Pro 


Ala 


Phe 


Glu 


Lys 








Ser 


His 


Gly 




130 










135 










140 








Gin 


Asp 


Tyr 




Val 


Gly 








Ser 








He 






145 










150 










155 










160 


Val 


Glu 


Leu 


Leu 


Tyr 
165 


Tyr 


Val 


Glu 


Glu 


170 


Asp 


Ser 


Ser 




He 
175 


Ser 


Ser 


Phe 


Pro 


Leu 
180 


Leu 


Lys 


Ala 


Leu 


Lys 
185 


Thr 


Arg 


He 


Ser 


Asn 
190 


Leu 


Pro 


Thr 


Val 


Lys 
195 


Lys 


Phe 


Leu 


Gin 


Pro 
200 


Gly 


Ser 


Pro 


Arg 


Lys 
205 


Pro 


Pro 


Met 


Asp 


Glu 
210 


Lys 


Ser 


Leu 


Glu 


Glu 
215 


Ala 


Arg 


Lys 


He 


Phe 
220 


Arg 


Phe 







(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 193710 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



Met 


Ala 


Ala 


Lys 


Pro 


Lys 


Leu 


Tyr 


Tyr 


Phe 


Asn 


Gly 




Gly 


Arg 


Met 


1 








5 










10 










15 




Glu 


Ser 


He 




Trp 


Leu 


Leu 


Ala 


Ala 


Ala 


Gly 


Val 


Glu 


Phe 


Glu 


Glu 








20 g 










25 










30 






Glu 


Phe 


Leu 


Glu 


Thr 


Arg 


Glu 


Gin 


Tyr 


Glu 


Lys 


Met 


Gin 


Lys 


Asp 


Gly 






35 










40 










45 




His 


Leu 




Phe 


Gly 


Gin 


Val 


Pro 


Leu 


Val 


Glu 


He 


Asp 


Gly 


Met 


Met 




50 










55 










60 










Leu 


Thr 


Gin 


Thr 


Arg 


Ala 


He 


Leu 


Ser 


Tyr 


Leu 


Ala 


Ala 


Lys 


Tyr 


Asn 


65 










70 










75 






80 


Leu 


Tyr 


Gly 


Lys 


Asp 


Leu 


Lys 


Glu 


Arg 


Val 


Arg 


He 


Asp 


Met 


Tyr 


Ala 










85 










90 










95 




Asp 


Gly 


Thr 


Gin 


Asp 


Leu 


Met 


Met 


Met 


He 


Ala 


Val 


Ala 


Pro 


Phe 


Lys 








100 










105 










110 




Thr 


Pro 


Lys 


Glu 


Lys 


Glu 


Glu 


Ser 


Tyr 


Asp 


Leu 


He 


Leu 


Ser 


Arg 


Ala 






115 










120 










125 






Lys 


Thr 


Arg 


Tyr 


Phe 


Pro 


Val 


Phe 


Glu 


Lys 


He 


Leu 


Lys 


Asp 


His 


Gly 




130 










135 










140 






Glu 


Ala 


Phe 


Leu 


Val 


Gly 


Asn 


Gin 


Leu 


Ser 


Trp 


Ala 


Asp 


He 


Gin 


Leu 
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145 150 155 160 

Leu Glu Ala lie Leu Met Val Glu Glu Leu Ser Ala Pro Val Leu Ser 

165 170 175 

Asp Phe Pro Leu Leu Gin Ala Phe Lys Thr Arg lie Ser Asn lie Pro 

180 185 190 

Thr lie Lys Lys Phe Leu Gin Pro Gly Ser Gin Arg Lys Pro Pro Pro 

195 200 205 

Asp Gly Pro Tyr Val Glu Val Val Arg lie Val Leu Lys Phe 
210 215 220 
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